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ABSTRACT 

The analysis of the Lyman-alpha (Lya) forest of absorption lines in quasar spectra has emerged as a 
potentially powerful technique to constrain the linear matter power spectrum. In most previous work, 
the amplitude of the ionizing background was fixed by calibrating simulations to match the observed 
mean transmitted flux in the Lya forest. This procedure is undesirable in principle as it requires 
the estimation of the unabsorbed quasar continuum level, a difficult undertaking subject to various 
sources of systematic error and bias. We suggest an alternative approach based on measuring the one- 
point probability distribution function (pdf) of the fluctuations in the flux about the mean, relative 
to the mean, i.e. the pdf of Sf = (/ — (/})/(/}■ This statistic, while sensitive to the amplitude of the 
ionizing background, has the virtue that its measurement does not require an estimate of the unabsorbed 
continuum level. We present a measurement of the pdf of Sf from seven Keck HIRES spectra, spanning 
a redshift range of z = 2.2 — 4.4. To illustrate that our method is useful, we compare our measurements 
of the pdf of Sf, and measurements of the flux power spectrum from Croft et al. (2002) at z — 2.72, 
with cosmological simulations. From this comparison, we obtain constraints on the mean transmission 
in the Lya forest, the slope of the temperature-density relation, as well as the amplitude and slope of 
the mass power spectrum. Our methodology will be useful for obtaining more precise constraints with 
larger data samples from the Sloan Digital Sky Survey (SDSS). 

Subject headings: cosmology: theory - intergalactic medium - large-scale structure of universe; quasars 
- absorption lines 



1. INTRODUCTION 

Tremendous progress has been made recently in using 
[ the Lya forest as a cosmological probe. In the emerging 
theoretical picture of the Lya forest, most of the struc- 
ture in the forest is attributed to the gravitational in- 
stability. The physics of the absorbing gas is simple; it 
is just that of gas in photoionization equilibrium with a 
spatially-homogeneous radiation field. On large scales the 
hydrogen gas distribution follows the dark matter distri- 
bution, and on small scales it is Jeans pressure-smoothed 
(see e.g., Cen et al. 1994, Zhang et al. 1995, Hernquist 
et al. 1996, Miralda-Escude et al. 1996, Muecket et al. 
1996, Bi & Davidsen 1997, Bond & Wadsley 1997, Hui et 
al. 1997, Croft et al. 1998, Theuns et al. 1999, Bryan 
et al. 1999, Nusser & Haehnelt 1999). In this theoretical 
picture, each quasar spectrum essentially provides a one- 
dimensional map of the density field in the intergalactic 
medium (IGM) at z ~ 3, implying constraints on the am- 
plitude and slope of the linear matter power spectrum at 
z ~ 3 and scales of k ~ 0.1 — 5h Mpc -1 (see e.g., Croft et 
al. 1998, McDonald et al. 2000, Croft et al. 2002). 

This is a particularly valuable probe since one can in- 
fer the linear power spectrum of density fluctuations on 
scales smaller than those examined by current Cosmic Mi- 



crowave Background (CMB) experiments and galaxy sur- 
veys (Croft et al. 1998). The measurements of the Lya 
forest flux power spectrum thus provide an important com- 
plement to other probes of large-scale structure, when con- 
straints on small-scale clustering from the Lya forest are 
combined with constraints on large scales from CMB ex- 
periments. The leverage from the large range of scales 
probed allows for constraints on neutrino mass (Croft, 
Hu & Dave 1999, McDonald 2004a, 2004b, Seljak et al. 
2004), on warm dark matter (Narayanan et al. 2000, Viel 
et al. 2005), and on the possibility that the primordial 
power spectrum deviates from a pure power law (Spergel 
et al. 2003, Seljak, McDonald, & Makarov 2003, hereafter 
SMM03, McDonald 2004a, 2004b, Seljak et al. 2004, Viel 
et al. 2004c). 

However, to obtain reliable constraints on the linear 
matter power spectrum from the Lya forest, a range of 
sources of systematic error need to be addressed in more 
detail. One such source of error is that most previous anal- 
yses relied on estimating the unabsorbed quasar contin- 
uum level, i.e., the quasar flux in the absence of absorption 
from the Lya forest. In particular, measurements of the 
mean transmitted flux through the Lya forest were used to 
calibrate the amplitude of the photo-ionizing background, 
a crucial input parameter in the theoretical modeling of 
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the Lya forest. This amplitude can only be constrained 
from observations in an indirect manner, and it is not feasi- 
ble to predict this quantity from first principles with any- 
where close to the required accuracy. Unfortunately as 
SMM03 emphasize, it is difficult to robustly measure the 
mean transmitted flux in the Lya forest and hence to con- 
strain the amplitude of the ionizing background with high 
accuracy. 

In order to elaborate further, we briefly describe the 
measurements of the mean transmitted flux and their dif- 
ficulties. Two main approaches have been used to estimat- 
ing the unabsorbed continuum level (sec e.g., SMM03). 
One approach is to extrapolate the continuum from the 
red side of the Lya forest, where there is no Lya absorp- 
tion. This strategy has been applied by Press, Rybicki, 
& Schneider (1993), (hereafter PRS93), and by Bernardi 
et al. (2003). The other method is to locate regions in 
the Lya forest where there is apparently no absorption, fit 
these with a high-order polynomial, and call this the con- 
tinuum level (Rauch et al. 1997, McDonald et al. 2000). 

Each approach has systematic difficulties. The first may 
cause an underestimate of the mean transmitted flux: This 
is because, as SMM03 point out, there may be a break 
in the slope of the quasar continuum near the Lya emis- 
sion line; the continuum becoming less steep as one moves 
across the Lya line from red to blue, as seen in low redshift 
quasar spectra (Zheng et al. 1997, Telfer et al. 2002). 1 
Furthermore, the slope of the quasar continuum varies 
significantly from quasar to quasar (Telfer et al. 2002). 
The fitting together of 'unabsorbed' regions in the sec- 
ond method adds an undesirable subjective element to the 
data analysis, and inevitably breaks down at sufficiently 
high redshift where there are no, or very few, unabsorbed 
regions in the Lya forest. This approach tends to overesti- 
mate the mean transmitted flux in the Lya forest. There 
is yet another difficulty with this type of continuum fitting. 
The difficulty arises because quasar spectra are typically 
taken with an echcllograph. The full quasar spectrum is 
then pieced together from several individual echelle orders, 
with the signal to noise of the spectrum dropping off near 
the edges of each echelle order (see e.g. Hui et al. 2001). 
This imprints a periodic structure in the noise, an added 
obstacle in continuum estimation. 

In contrast to the measurements of the mean transmit- 
ted flux, the unabsorbed quasar continuum level does not 
need to be estimated in order to measure the flux power 
spectrum. In measuring the flux power spectrum, the 
power spectrum of the fluctuations in the flux about the 
mean, relative to the mean is the quantity of interest. That 
is, the power spectrum of Sf = (/ — (/))/(/) is estimated. 
To determine this quantity observationally, the mean flux 
(/) (Hui et al. 2001, Croft et al. 2002) can be measured 
directly, without first rescaling to the unabsorbed contin- 
uum level. This statistic is sensitive to the shape of the 
continuum, since the continuum, and hence the mean flux, 

lr The exact location of the break is poorly determined, due to 
the large number of emission lines near the break point. Telfer et al. 
(2002) indicate only that the break occurs between 1200 — 1300A. On 
the other hand, Zheng et al. (1997) found, with a smaller sample, 
that the break occurs near 1050A, which would be irrelevant for 
determining the mean flux in the Lya forest. The reason for the 
difference in the position of the break point between the two studies 
is not clear. 



varies slowly across a quasar spectrum, but it is not sen- 
sitive to the normalization of the quasar continuum. It is 
thereby preferable to have a method of calibrating the am- 
plitude of the ionizing background that does not depend 
on estimating the normalization of the quasar continuum. 

Furthermore, when measurements of the flux power 
spectrum are combined with measurements of the mean 
transmitted flux, the resulting constraints on the linear 
matter power spectrum depend sensitively on the assumed 
mean transmitted flux (Croft et al. 2002, Zaldarriaga et 
al. 2001a, Zaldarriaga et al. 2003, SMM03). This is be- 
cause the effect of changing the amplitude of the ionizing 
background on the flux power spectrum is rather degen- 
erate with the effect of varying the linear matter power 
spectrum. 

Additional information, beyond that contained in the 
flux power spectrum, is therefore necessary to simultane- 
ously constrain the linear matter power spectrum and the 
amplitude of the ionizing background. McDonald et al. 
(2004b), however, note that variations in the linear power 
spectrum and the mean transmission, affect the flux power 
spectrum differently at different redshifts. Therefore they 
can break the degeneracy that we mention using only mea- 
surements of the flux power spectrum, by virtue of the long 
redshift span and high statistical precision of data from 
the SDSS. For this to work, they must assume that the 
mean transmission and other nuisance parameters evolve 
smoothly with redshift. The technique we will describe 
below can provide a useful check of these assumptions, 
and possibly tighten their constraints further. For this 
purpose, an alternative to the usual source of 'additional 
information', the mean transmitted flux, is desirable. 

Croft et al. (2002) suggest one such alternative method 
for calibrating the amplitude of the ionizing background 
based on measuring the number of times the flux crosses a 
given threshold in an observed quasar spectrum and com- 
paring this with the threshold crossing frequency in simu- 
lated quasar spectra. In this paper, we take this suggestion 
a step further by measuring the full probability distribu- 
tion function (pdf) of 5f, which also does not require an 
estimate of the unabsorbed continuum level. We find that 
this quantity itself is sensitive to the amplitude of the ioniz- 
ing background, and we thereby sidestep one of the biggest 
loopholes in parameter estimation from the Lya forest - by 
going straight from quasar spectra to parameter constraints 
without continuum fitting. 

This extends previous work which measured the prob- 
ability distribution of flux (normalized to the unabsorbed 
continuum level) and compared to simulations. In particu- 
lar, Rauch et al. (1997) and McDonald et al. (2000), found 
that canonical ACDM models provide a good match to the 
observed probability distribution of the flux (normalized to 
the unabsorbed continuum level) , and Gaztanaga & Croft 
(1999) studied the one-point flux pdf in the context of 
the gravitational instability model of the Lya forest. Des- 
jacques and Nusser (2004) also recently emphasized the 
advantage of using the pdf of the flux (normalized to the 
unabsorbed continuum level) along with the flux power 
spectrum to constrain the linear matter power spectrum. 

In the present paper, we first measure the pdf of Sf, 
and demonstrate that this quantity is insensitive to the 
behavior of the quasar continuum. Next, we combine our 
measurement of the pdf of Sf with the flux power spec- 
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trum measurements of Croft et al. (2002) at z = 2.72. 
Our intention here is simply to illustrate that the pdf of 
Sf is useful, when employed in conjunction with the usual 
flux power spectrum, for constraining cosmological param- 
eters and the parameters that describe the physics of the 
IGM. Our measurements of the pdf of Sf can be com- 
bined with more precise measurements of the flux power 
spectrum from SDSS data (from McDonald 2004a, or Hui 
et al. in prep.). In addition, the pdf of Sf can itself be 
well-measured from SDSS data (Burgess, Buries et al. in 
prep.). Measurements of the pdf of Sf may be regarded as 
part of a larger effort to incorporate higher order statistics 
into the analysis of the Lya forest, as suggested by Zaldar- 
riaga et al. (2001b), Mandelbaum et al. (2003), and Viel 
et al. (2004a). The pdf, as a one point statistic, is a sensi- 
ble starting place for incorporating higher-order statistics 
into the analysis of the Lya forest. 

The outline of the paper is as follows. In §2 we illus- 
trate the degeneracy between the amplitude and slope 
of the mass power spectrum and the mean transmitted 
flux in the Lya forest, reiterating the main points of 
SMM03. In §3 we present a measurement of the pdf of 
Sf from Keck HIRES data and show that our measure- 
ment is robust to the treatment of the quasar continuum. 
In §4 we illustrate how the measurement of the pdf of 
Sf helps to tighten constraints on the slope and ampli- 
tude of the mass power spectrum. In §5 we present con- 
straints on the slope of the temperature-density relation, 
and in §6 we conclude. In the Appendix, we present some 
details regarding our simulations, and examine the con- 
vergence of our results with respect to simulation resolu- 
tion and box size. Tables of the measurements presented 
in this paper are available electronically at the website 
http://t8web.lanl.gov/people/heitmann/lyma/. 

2. DEGENERACY BETWEEN THE AMPLITUDE AND 
SLOPE OF THE MATTER POWER SPECTRUM AND THE 
MEAN TRANSMITTED FLUX 

In this section we illustrate the degeneracy between the 
amplitude of the mass power spectrum and the mean flux 
in the Lya forest. We first demonstrate that in the absence 
of assumptions about the mean flux in the Lya forest, only 
very weak constraints on the amplitude and slope of the 
linear mass power spectrum are obtained (SMM03). We 
illustrate our points using observational measurements of 
the flux power spectrum from Croft et al. (2002) at a 
rcdshift of (z) = 2.72. In order to introduce notation, we 
provide a very brief recap of theoretical models of the Lya 
forest. For more details the reader is referred to e.g, Hui 
et al. (1997). 

Assuming photo-ionization equilibrium, the optical depth 
to Lya absorption is given by 

T = A(l + S) 2 -°- 7a . (1) 

Here S is the gas over-density and a is the power law in the 
temperature-density relation, i.e., T = T (l + S) a . 2 The 
parameter A is proportional to (e.g., Rauch et al. 1997, 

2 Here T is the temperature at ovcrdensity S, and To is the tem- 
perature at the cosmic mean density. The temperature of the low 
density gas in the IGM at z ~ 3 is expected to be tightly correlated 
with its over-density (Hui & Gnedin 1997). 



Hui et al. 2001) 

A (1 + Z f, (2) 

1 HI tl{Z) 

where fib is the baryon density in units of the critical den- 
sity, H (z) is the Hubble parameter at redshift z, Hq is the 
Hubble parameter today, h is H o /100 km/s/Mpc, and Thi 
is the photoionization rate of hydrogen, which is directly 
proportional to the amplitude of the ionizing background. 
The parameter A is determined by a product of cosmolog- 
ical parameters, which are known to high accuracy, with 
the combination T _0 ' 7 /Fhi, which is uncertain. The opti- 
cal depth is then shifted into redshift space and convolved 
with a thermal broadening window. The flux transmitted 
through the Lya forest is given by F/F c = e~ T , and the 
amount of flux absorbed by the Lya forest is 1 — F/F c . 
Here Fj F c denotes the flux normalized to the unabsorbed 
quasar continuum level. Our model of the Lya forest is 
then complete given the baryonic matter density and pe- 
culiar velocity fields specified by our numerical simulations 
for a given cosmological model (see the Appendix for de- 
tails regarding simulations). 3 

Our methodology for obtaining constraints on the mat- 
ter power spectrum and the physics of the IGM from 
this modeling is very similar to that of Zaldarriaga et al. 
(2001a, 2003) (see also Lidz et al. 2003). In brief, we 
generate the flux power spectrum and pdf for a large grid 
of simulated models, calculate the likelihood of the model 
given the data, and marginalize over nuisance parameters 
to obtain constraints on the amplitude and slope of the 
linear matter power spectrum. For this purpose, we ran a 
grid of models describing the IGM, with parameter vector 
(a, n, To, a, (F/F c )). Our grid covers the following range 
in each parameter: 

• a= (0.0919,0.1301,0.1479,0.1682,0.1914,0.2178, 
0.2480, 0.2827, 0.3227, 0.3691, 0.4236, 0.5327) 

• n= (0.7,0.8,0.9,1.0,1.1,1.2,1.3) 

• T = (200,250,300,350,400, 
450) (km/s) 2 

• a = (0.0,0.1,0.2,0.3,0.4,0.5,0.6) 

• (F/F c ) = (0.670,0.680,0.685,0.690,0.695,0.700, 
0.705, 0.710, 0.715, 0.720, 0.725, 0.730, 0.735, 0.740, 
0.745, 0.750, 0.755, 0.760, 0.770, 0.780) 

The parameters of our simulated model are : a, the 
scale factor of the simulation output, which effectively cor- 
responds to the normalization of the matter power spec- 
trum; n, the slope of the primordial power spectrum, each 
slope corresponding to a different linear power spectrum 
slope on the scales probed by the Lya forest; T , the tem- 
perature at mean density; 4 a, the slope of the tempera- 
ture density relation and (F/F c ), the mean transmission 
in the Lya forest. In this paper we generally parameter- 
ize our model in terms of the mean transmission, (F/F c ), 

3 We generate the baryonic density field using an implementation 
of HPM, the pseudo-hydrodynamic recipe proposed by Gnedin & 
Hui (1998). 

4 The temperature is given in units of (km/s) 2 , which is related 
to the temperature in K by T = T [(km/s) 2 ] X 10 4 /165 K. 
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although varying this parameter corresponds to varying 
A and hence the amplitude of the ionizing background 
[Eqn. (2)]. We emphasize that, in later parts of the pa- 
per, we do not use measurements of the mean transmission 
to fix the amplitude of the ionizing background. In spite 
of this we will still parameterize our model in terms of 
(F/F c ), rather than in terms of the amplitude of the ion- 
izing background. This will be convenient for comparing 
our results with different measurements of (F/F c ) that ap- 
pear in the literature. In addition to these parameters, we 
need to specify a thermal history in order to calculate the 
gas pressure term in our HPM simulations. We describe 
this in the Appendix. 

In this paper we use large volume simulations, which 
lack sufficient resolution to completely resolve the small- 
scale structure in the Lya forest. Hence, in comparing our 
simulated models of the flux power spectrum with Croft 
et al. (2002) 's measurements, we confine the compari- 
son to scales of k <J 0.02 s/km. 5 In the Appendix, we 
demonstrate that even restricting our analysis to these 
scales our simulation resolution is somewhat marginal, 
and we attempt to quantify the resulting systematic er- 
ror. We will generally quote constraints on the amplitude 
and slope of the matter power spectrum as constraints on 
the dimensionlcss amplitude of the power spectrum at red- 
shift z and scale k p = 0.03 s/km, A 2 = k^P(k p ,z)/2TT 2 , 
and the logarithmic slope at the same scale and redshift, 
n e ff = dinP (k p ) / dink (Croft et al. 2002). For reference, at 
z = 2.72, the central redshift of the data sample in Croft 
et al. (2002), the scale k v corresponds to a co- moving 
scale of k = H(z = 2.72)/(l + z)k p , which is k = 3.24/i 
Mpc -1 for a flat, ACDM cosmology with matter density 
O ro = 0.3. 

Next, we have to consider the priors we should adopt 
on the different nuisance parameters involved in the mod- 
eling. For the most part, we will be fairly conservative in 
adopting these priors. The temperature of the IGM, how- 
ever, has been constrained using simulations with smaller 
volume, but higher resolution than our present simulations 
(see, e.g., McDonald et al. 2001, Zaldarriaga et al. 2001a). 
The most pronounced effect of gas temperature on the flux 
power spectrum is a small-scale smoothing on scales of 
k ;> 0.05 s/km. Since these scales are not fully resolved by 
our simulations, we will not attempt to constrain T with 
the present simulations. Motivated by results from the 
higher resolution simulations, however, we adopt a prior 
on the temperature of the IGM 6 of T = 300 ±50 (km/s) 2 . 
(We will relax our prior on T in §5 to demonstrate that 
our constraint on the temperature-density relation, a, is 
not sensitive to the prior on To.) This prior on the temper- 
ature corresponds to the constraint found by McDonald et 
al. (2001), except with an error bar approximately twice 
as large. Conservatively, we will let a vary freely over the 
entire physically allowed range of a = 0.0 — 0.6 (Hui & 
Gnedin 1997). There is one sense in which we are not con- 
servative, however: in calculating the gas pressure term in 

5 Our numerical simulations were run primarily for comparison 
with quasar spectra from the SDSS. These spectra have relatively 
poor spectral resolution, motivating us to sacrifice simulation reso- 
lution for simulation volume. 

6 In the present paper, we limit our theoretical modeling to mod- 
eling at a redshift of z = 2.72. This prior, and those listed subse- 
quently, thus refer to the priors that we adopt at z = 2.72. 



our HPM simulations, we fix the thermal history of the 
IGM (see the Appendix for details). This was done partly 
to reduce computational burden, but also because, on the 
large scales we consider here (k <J 0.02 s/km), the effects 
of gas pressure smoothing are likely to be sub-dominant 
compared to the size of Croft et al. (2002) 's error bars. 

With this in mind, we generate the flux auto spectrum 
for each model in our grid and compute the likelihood of 
the model given the data. We show the resulting likeli- 
hood contours in §4. Presently, we illustrate the degener- 
acy between (F/F c ) and the amplitude and slope of the 
mass power spectrum, with example model fits from our 
grid. The model fits each have very different mass power 
spectrum amplitudes and slopes, compensated by rather 
different values of the mean flux, and somewhat by small 
changes in the other nuisance parameters. This is shown 
in Figure 1. In this plot, one can see that, assuming noth- 
ing about the mean flux in the Lya forest, the amplitude 
of the mass power spectrum is uncertain by an order of 
magnitude, and the slope of the mass power spectrum has 
an error of at least 20%. While the range of values for 
the mean transmitted flux considered in the figure is per- 
haps excessively conservative, a wide range of mean trans- 
mitted flux priors have been considered in the literature. 
Specifically, the prior on the mean transmission adopted 
by Croft et al. (2002), is (F/F c ) = 0.705 ± 0.012, while 
SMM03 consider two priors based on the measurements 
of McDonald et al. (2000): (F/F c ) = 0.742 ± 0.012, and 
one with expanded error bars, (F/F c ) = 0.742 ± 0.027, 
and Viel et al. (2004b) adopt (F/F c ) = 0.730 ± 0.011. 7 
The best fit amplitude of the mass power spectrum with 
the Croft et al. (2002) prior on the mean transmitted 
flux differs by a factor of ;> 2 than that obtained with the 
SMM03 prior (see the models in Figure 1). Furthermore, 
SMM03 show that the error contours become very large 
with their prior on the mean transmitted flux, in compar- 

7 The Croft et al. (2002) prior is based on measurements by 
PRS93, who give a power law fit to the effective optical depth, 
t cB = -ln(e- T ) = A(l + z)l +1 , with error bars of A = 0.0f75 - 
0.00567 ± 0.0002 and 7 = 2.46 ± 0.37. It is not exactly clear how 
to interpret these error bars: there has been some debate over what 
the PRS93 fit implies for the central value of (F/F c ) and its error 
bar. The prior adopted by Croft et al. (2002) corresponds to us- 
ing the central values of A and 7 to determine the central value of 
(F/F c ), but the rationale behind their assumed error bar on (F/F c ) 
is unclear. SMM03, on the other hand, argue that assuming that A 
and 7 are Gaussian-distributed actually implies a larger central value 
for (F/Fc). Meiksin & White (2004), with a still different interpre- 
tation of the same error bars, also argue for a larger central value 
for (F/Fc). SMM03 suggest that their interpretation of the PRS93 
measurements brings them in closer agreement with measurements 
from McDonald et al. (2000) (see above for the mean transmitted 
flux that SMM03 adopt). However, in addition to the uncertainty 
involved with interpreting the PRS93 measurements, there are fur- 
ther reasons to be hesitant about concluding that the different mea- 
surements of (F/F c ) arc consistent with each other. First, as we 
mentioned earlier, the same authors argue that the PRS93 measure- 
ments, by extrapolating the unabsorbed continuum level from the 
red side of the Lya emission line, should be biased low due to a 
break in the slope of the continuum near the Lya emission line. Sec- 
ond, Bernardi et al. (2003) use a method similar to that of PRS93, 
(yet more sophisticated), to estimate (F/F c ) and find (F/F c ) ~ 0.70 
at z ~= 2.72, with ~ 2% error bars. Presently we remain agnostic as 
to which measurement of (F/F c ) is most accurate. Our goal is just 
to illustrate the sensitivity of matter power spectrum constraints to 
different assumptions about the mean flux that have appeared in the 
literature. In §4, however, we obtain constraints on the mean flux in 
the Ly« forest using our measurements of the pdf of Sf. 
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Fig. 1. — Auto spectrum from Croft et al. (2002) at 
(z) = 2.72 compared to models that compensate for their very 
different mass power spectrum amplitudes and slopes by hav- 
ing different mean transmissions. fn our subsequent analysis, 
we only compare theoretical models with data on scales of k 
less than 0.02 s/km owing to our limited simulation resolu- 
tion. The models are completely specified by the parameter 
vector (A 2 , n e ff , To, a, (F/F c )). The models shown are speci- 
fied by the following parameter sets: (0.37,-2.68,350,0.5,0.680), 
(0.71, -2.58, 300, 0.4, 0.720), (1.04, -2.48, 300, 0.4, 0.735), 

(1.53, -2.38, 300, 0.4, 0.750), (3.29, -2.18, 300, 0.3, 0.770). 

ison with the error contours corresponding to the Croft et 
al. (2002) prior. The flux field becomes primarily sensitive 
to larger and larger over-densities as the mean transmitted 
flux increases, and one loses sensitivity to the linear mat- 
ter power spectrum (see also Zaldarriaga et al. 2003). The 
figure also illustrates that including information at high fc, 
(we only compare theory with data on scales of k <J 0.02 
s/km due to the limited resolution of our simulations), 
would only help slightly in obtaining tighter constraints. 

The lesson here, reiterating the main point of SMM03, 
as well as cautions mentioned by Croft et al. (2002), is that 
one needs a tight constraint on the mean transmission in 
the Lya forest in order to derive constraints on the mass 
power spectrum from the Lya forest flux power spectrum. 
There are a few possible ways to circumvent this difficulty. 
One approach is to determine any bias in estimates of the 
mean absorption obtained by fitting together 'unabsorbed' 
regions in the Lya forest using simulated spectra, as Rauch 
et al. (1997) and McDonald et al. (2000) attempted to 
do. An inherent difficulty is that mock spectra are typ- 
ically produced in simulation boxes that are small com- 
pared to the length of spectra over which the continuum- 
fitting procedure is performed. Furthermore, the size of 
the estimated bias may be model-dependent. Another ap- 
proach might be to correct estimates of the unabsorbed 
quasar continuum level extracted from the red side of Lya 
based on the break found in low redshift quasar spectra. 



However, the correction depends on the position of the 
break, which is poorly determined from the low redshift 
quasar spectra. As we mentioned in the Introduction, Mc- 
Donald et al. (2004b) circumvent the degeneracy we men- 
tion by exploiting the fact that the flux power spectrum 
depends differently on the modeling parameters at differ- 
ent redshifts. The disadvantage of this approach is that 
they must assume that the nuisance parameters evolve 
smoothly with redshift. Yet a different approach, which 
we pursue presently, is to avoid using the mean transmit- 
ted flux at all, and to instead calibrate the amplitude of 
the ionizing background, [or equivalently the parameter A 
in Eqn. (2)], using a different statistic entirely. 

3. MEASURING THE PROBABILITY DISTRIBUTION OF 
THE FLUX FIELD AND ITS ERROR BARS 

In this section, we present a measurement of the prob- 
ability distribution of the flux field, <5/, which will be use- 
ful in calibrating the parameter A in Eqn. (2), as we will 
demonstrate. We choose a particular estimator for 5f that 
we can measure robustly from the data sample as well as 
the numerical simulations. Our objectives here are to: 1) 
Choose a quantity that can be measured accurately from 
our simulations, 2) Use an estimator of Sf that does not 
rely on estimating the level of the unabsorbed quasar con- 
tinuum, 3) Identify an estimator that is insensitive to con- 
tamination from the unknown shape of the quasar contin- 
uum. 

Several different estimators for Sf have been discussed 
in the literature in the context of measuring the flux power 
spectrum (Hui et al. 2001, Croft et al. 2002). We now 
briefly describe these estimators. The number of photon 
counts, N p , in the Lya forest portion of a quasar spectrum 
is the product of a slowly varying quasar continuum, N c , 
with e~ T , (ignoring additional opacity from metal absorp- 
tion lines). Fluctuations in N p arising from fluctuations 
in eT T have to be separated out from those that arise due 
to structure in the quasar continuum. This can be ac- 
complished, at least on sufficiently small scales, since the 
quasar continuum, N c , varies smoothly in comparison to 
e _T . We proceed to discuss how this is done. The mean 
transmission, i.e., e~ T averaged over an ensemble of quasar 
spectra, (F/F c ) — (e~ T ), is not a function of wavelength 
for a sufficiently small stretch of a quasar spectrum. In 
spite of this, the mean number of photon counts depends 
on wavelength due to the slowly varying structure in the 
quasar continuum. This 'running mean' quasar count can 
be estimated from the spectrum by fitting a low order 
polynomial to the quasar spectrum (Hui et al. 2001), or 
by smoothing the spectrum with a large radius filter (Croft 
et al. 2002). Then Sf is formed by subtracting the running 
mean quasar count from the raw quasar count, finally di- 
viding out by the running mean. This procedure thereby 
offers a means of forming Sf and measuring its flux power 
spectrum without first estimating the normalization of the 
quasar continuum. 

The estimator we use for Sf in measuring the flux pdf is 
very similar to the estimator used by Croft et al. (2002) 
for measuring the flux power spectrum. In particular, we 
will demonstrate that Sf defined by: 

" /(A) = MX) (3) 
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satisfies the three desired criteria we outline above. In 
this equation / r (A) indicates the value of the flux field, /, 
at wavelength A when smoothed with a Gaussian filter of 
radius r - i.e., the variance of the Gaussian is a 2 = r 2 , 
and /fl(A) denotes the flux field at wavelength A when 
smoothed with a Gaussian filter of radius R. The filter, 
r, has a small radius, and serves to take out small-scale 
power on scales that are under-resolved in our simulation. 
In the Appendix we demonstrate that r = 30 km/s is 
an appropriate filter for our present simulations. The fil- 
ter, R, represents a large-scale smoothing that defines the 
running-mean flux in the quasar spectrum. Given a sam- 
ple of quasar spectra it is straightforward to estimate Sf 
as defined above, and measure its one-point probability 
distribution function. 

We carry out this procedure using spectra obtained with 
the High Resolution Echelle Spectograph (HIRES) on the 
Keck telescope. These quasar spectra are the same as 
described in Rauch et al. (1997) and in McDonald et al. 
(2000). (See these papers for references regarding the data 
reduction procedure.) The resolution of the quasar spectra 
is FWHM ~ 6.6 km/s and the typical signal to noise per 
0.04A pixel is S/N ~ 50. The quasar spectra, listed by 
name, emission redshift, and wavelength range examined, 
are shown in Table 2. Before measuring the flux pdf, but 
after performing the smoothing procedure described by 
Eqn. (3), we attempt to cut out metal lines, damped- lyman 
alpha systems, and spurious pixels, using wavelength cuts 
similar to those in Rauch et al. (1997) and McDonald et 
al. (2000). Unlike Rauch et al. (1997) and McDonald 
et al. (2000), we use only the 'raw' data, as opposed to 
continuum-fitted data, in making our measurements. 

We split the data sample into four relatively narrow red- 
shift bins, and measure the flux pdf in each redshift bin. 
The redshift bins are defined in Table 1 below. One of the 
redshift bins is chosen to have a mean redshift of z = 2.72, 
the central redshift of the Croft et al. (2002) sample. In 
the present paper, we will limit our theoretical analysis 
to this redshift bin, and we will use it to illustrate our 
procedure for measuring the flux pdf. 

After splitting our data sample into several redshift bins, 
we smooth each quasar spectrum in order to form Sf as 
described above, and then re-bin each spectrum into pixels 
of size 20 km/s. Data within 2, 000 km/s, (which corre- 
sponds to 4 times the radius of our large-scale smoothing 
filter, R), of the edge of each spectrum are cut after per- 
forming the smoothing to avoid edge effects. Upon forming 
the flux field from each quasar spectrum, we measure the 
pdf using 36 bins in Sf, each with a width of 0.061538. The 
center of the lowest flux bin corresponds to Sf = —1.01538. 
In the event that a pixel has flux lower (higher) than the 
lower (upper) edge of the smallest (largest) Sf bin it is 
included in the lowest (highest) Sf bin. We compute the 
average flux in each bin, and use this, rather than the bin 
center, when comparing with theoretical models, as allows 

Next, we discuss our procedure for estimating error bars. 
In order to measure the variance and covariance of our 
estimates of the pdf of Sf, we use a jackknife technique. 
Specifically, we 1) estimate the pdf from the full data sam- 
ple, 2) divide the data set into n g = 30 different subgroups, 
and 3) we estimate the pdf of the data sample omitting 
each sub-group. Let P(Si) represent the pdf estimated 



from the full data sample for a bin with average flux, Si. 
In addition, let Pk{Si) represent the same quantity, es- 
timated not from the full data sample, but from a data 
sample that omits the pixels in the kth subgroup. Then 
the jackknife estimate of the covariance between the esti- 
mates of the pdf in a bin with average flux, Si, and a bin 
with average flux, 5j is given by: 

n g 

Cov(i,j) = J2 [Pfa) - Pk(Si)] [PiSf) - PkiSj)] (4) 
fe=i 

and the diagonal variance, for a bin with flux Si, is of = 
Cov(i, i). 

In order to check the accuracy of our jackknife estimate 
of the error bars, we generate mock spectra using the log- 
normal model described in McDonald et al. (2004a). 8 This 
provides a test of our error estimation procedure: first 
we measure the dispersion across many independent real- 
izations generated using our lognormal simulations, and 
second we make a jackknife estimate from a single real- 
ization. To the extent that the jackknife method agrees 
with error bars estimated from many independent realiza- 
tions, we can be confident that it is sound. We illustrate 
our procedure for constructing mock realizations of the 
data considering the redshift bin centered on (z) = 2.72 
as an example. The corresponding procedure for the other 
redshift bins is very similar. First we generate four lines 
of sight, corresponding to the number of lines of sight in 

8 The lognormal model we adopt is slightly different than that of 
McDonald et al. (2004a). Specifically, McDonald et al. (2004a) go 
from the lognormal 'density field', A, to an optical depth field with 
the transformation r oc A 2 . Here we instead adopt r <x A 1,79 , and a 
slightly different proportionality constant to relate optical depth to 
density. If the temperature-density relation is T = Tq(1 + 5) a , our 
choice corresponds to a = 0.3, while McDonald's choice corresponds 
to the isothermal case, a = 0.0. We find that this provides a better 
fit to the pdf, while retaining a rough match to the observed redshift 
evolution of the mean transmitted flux in the Lya forest. 




Fig. 2. — A demonstration that our lognormal model roughly 
reproduces the pdf of the data. 
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Table 1 

Redshift Bins for PDF measurements 



^low ^high 



2.20 2.45 2.26 

2.46 3.00 2.72 

3.01 3.70 3.28 

3.71 4.43 3.99 



our actual data sample that fall into the (z) = 2.72 bin. 
Each line of sight spans the length of our redshift bin, 
z — 2.46 — 3.00, on a fine grid of 16384 pixels. We form 
Sf according to Eqn. (3), coarse-sample onto pixels of size 
20 km/s, and cut out regions of spectra until the mock 
spectra have exactly the same wavelength coverage as the 
actual spectra. In Figure 2, we show a measurement of 
the pdf from many realizations of our lognormal model as 
compared to a measurement of the pdf from real data. The 
comparison illustrates that the lognormal model roughly 
reproduces the pdf of the actual data, although there are 
some differences between the model and data. We em- 
phasize that we only use the lognormal model to 1) test 
our jackknife estimates of the variance of the pdf, 2) in- 
dicate the correlation coefficient between our estimates of 
the pdf in different flux bins. This second step is nec- 
essary because, as we detail below, our estimates of the 
off-diagonal elements of the covariance matrix from our 
present data set are too noisy to be useful. In the future, 
with larger data samples, we can just estimate the covari- 
ance matrix directly. 

How well does the jackknife technique described above 
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Fig. 3. — Comparison of two estimates of the error bars on the 
flux pdf, as measured from lognormal simulations. The first error 
bar estimate comes from the dispersion across many independent 
realizations, and the second is a jackknife estimate. The fractional 
difference between the fractional error bars is shown. 



work? In Figure 3 we compare our jackknife estimate of 
the fractional error on the flux pdf with an estimate from 
the dispersion across many simulation realizations. The 
jackknife error estimate is generally good to better than 
20%, but seems to produce a systematic overestimate of 
the error at low Sf. The apparent overestimate of the er- 
rors is at least conservative in the sense that parameter 
constraints would only tighten if we were to adopt smaller 
error bars on these points. On the other hand, the error 
bars on the points in the high Sf tail may be underesti- 
mated using the jackknife estimator. In this case, there are 
so few pixels in the high Sf tail of the pdf that our jack- 
knife method is unreliable. To overcome this difficulty we 
again adopt a conservative approach and use the variance 
measured from the lognormal simulations to represent the 
variance for the high Sf bins in our measurement from real 
data. 

Next, we consider the off-diagonal terms in the co- 
variance matrix. We find that our jackknife estimates 
of the off-diagonal elements of the covariance matrix are 
quite noisy. After all, we are attempting to measure 
36 x 37/2 = 666 matrix elements from only 3499 data 
pixels. To get around this difficulty, we first measure the 
correlation coefficient from many realizations of the mock 
lognormal spectra. The correlation coefficient is defined 
by r(i,j) = Cov(i,j)/y/Cov(i, i)Cov(j,j). Next we as- 
sume that the real data has the same correlation coeffi- 
cient as the mock data, and estimate the data covariance 
from the simulated correlation coefficient r{i,j) and the 
jackknife estimate of the data variance, i.e. Cov^(i,j) = 

r s{i, j)\/Covd{i, i)Covd(j, ])■ Here the subscript 'd' indi- 
cates a quantity measured from real data, while the sub- 
script 's' represents a quantity measured from the lognor- 
mal simulations. This approach allows us to estimate the 
off-diagonal terms in the data covariance matrix, which 
are significant, but cannot be estimated cleanly from the 
data alone. The approach is justified to the extent that 
the correlation coefficient in the lognormal simulation is 
representative of the true correlation coefficient in the real 
data. In Figure 4 we show a comparison of the correla- 
tion coefficient measured from the data with a jackknife 
estimator, which is noisy, and that measured from the log- 
normal simulations, which is smooth. From the figure, one 
can see that the simulation measurement is fairly consis- 
tent with a smooth version of the noisy jackknife estimate, 
justifying our approach. 

This procedure then provides us with an estimate of 
the full covariance matrix. There is one last detail that 
we need to address before proceeding with the remaining 
analysis. The difficulty is that the flux pdf is required to 
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satisfy a normalization condition, J d5fP(St) — 1, and the 
data vectors are hence not linearly independent, and the 
covariance matrix is nearly singular. We therefore diago- 
nalize the covariance matrix, and rebuild it by eliminat- 
ing the eigenvector with the smallest eigenvalue. In other 
words, the rebuilt covariance matrix can be written as 



Ci 



E 

fc=i 



UikUjkXk, 



(5) 



where CV, is the 'regularized' covariance matrix, U is the 
unitary matrix that diagonalizes the covariance matrix, 
and Afe is the kth eigenvalue of the covariance matrix. The 
eigenvalues are sorted by size, with Ai denoting the largest 
eigenvalue. The sum extends up to N m = 35, i.e. - N m is 
the number of bins minus 1. This 'regularized' covariance 
matrix can then be stably inverted and used to compute 
X 2 when comparing theoretical models with data (see §4). 

Our estimates of the flux pdf and its error bars in 
the fiducial redshift bin are given in Table 4. The flux 
pdf and error bars for the other redshift bins (see Ta- 
ble 1) are given in Tables 5-7. In Figure 5 we show 
our pdf measurement in the different redshift bins. The 
pdf evolves as one expects - a larger fraction of pix- 
els in the spectrum become opaque with increasing red- 
shift. Tables with the covariance matrices for each red- 
shift bin can be obtained electronically from the website 
http://t8web.lanl.gov/people/heitmann/lyma/. 

With our estimates of the flux pdf and its covariance 
matrix in hand, we now demonstrate that our estimator 
satisfies the criteria we described in the beginning of this 
section. The first criterion, that the estimator be inscnsi- 
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Fig. 4. — We show, on the one hand the noisy jackknife estimates 
of the correlation coefficient from the data (solid lines), and on the 
other hand, simulation estimates of the correlation coefficient (dot- 
ted lines). The red line is the correlation coefficient for a bin with 



an average 



c flux of <5 



f 



-0.832, the blue line has an average flux of 



5f = 0.0326, and the magenta line 5f = 0.769. The simulation mea- 
surements are fairly consistent with a smooth version of the noisy 
jackknife estimate from real data. 
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Fig. 5. — Measured flux pdf in redshift bins with mean redshifts 
of (z) = 2.26,2.72,3.28, and 3.99. 



tive to the numerical resolution limits of the simulations, 
is established in the Appendix. The second criterion is 
clearly satisfied, since the estimator defined in Eqn. (3) 
does not refer to the unabsorbed continuum level. 

The third criterion, that our estimator is insensitive to 
the shape of the quasar continuum, is not obvious. In order 
to investigate this, we estimate the pdf of Sf from the 'raw' 
data as well as from a version of the data that has first 
been continuum-normalized, as described in Rauch et al. 
(1997). If our procedure of applying a large-scale filtering 
to form Sf is robust to the unknown continuum shape, then 
we should get very much the same pdf whether we measure 
the pdf using the raw data or the continuum-fitted data. 

We illustrate our approach in Figure 6 where we mea- 
sure the pdf of Sf, for each of the raw data and the 
continuum-fitted data, in two different ways. First, we 
measure the pdf from each of the raw and the continuum- 
fitted data, assuming a flat mean. In this case the pdf of 
Sf differs substantially between the continuum-fitted and 
the raw data. The continuum has power on large scales, 
and the flux pdf will differ significantly between the raw 
and the continuum-normalized data if we don't account 
for this. Second, we measure the pdf from each of the raw 
and the continuum- fitted data, with Sf formed using the 
R = 500 km/s smoothing to define Sf as described above. 
In this case the pdf of Sf is extremely similar between 
the continuum-normalized and raw data. In one case we 
have measured the pdf of Sf from data that has first been 
continuum-normalized, and in the second case we did not 
perform any continuum-normalization, effectively assum- 
ing that the continuum is flat. The similarity between the 
pdf of Sf under these two very different assumptions re- 
garding the true quasar continuum, demonstrates that the 
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Fig. 6. — Flux pdf from continuum fitted data (red histogram) and 
from raw data (black points with error bars). The flux pdfs in the 
top panel are formed assuming a fiat mean to estimate <5 * , while in 
the bottom panel 5f is defined using a large-scale smoothing of R = 
500 km/s. The error bars on the pdf measured from the continuum 
fitted data, which are comparable to those from the raw data, are not 
shown for visual clarity. The close agreement between the pdf of the 
flux field measured from the raw data and that measured from the 
continuum-fitted data, when both are smoothed on a scale of Ft = 
500 km/s, illustrates that our procedure is robust to the continuum. 
The disagreement between the flux pdfs in the top panel, formed 
assuming a flat mean, is due to large-scale power in the continuum. 



pdf ofSf is insensitive to our assumptions about the quasar 
continuum. To provide a quantitative measure of the simi- 
larity between the pdf measured in the two different ways, 
we compute \ 2 between the two probability distributions. 
We find, comparing our 36 Sf bins, and including estimates 
of the off-diagonal elements in the covariance matrix, that 
X 2 = 8.3. The pdf of 5/ for the continuum- fitted and raw 
data are therefore in close agreement. 9 

4. CONSTRAINTS ON THE MASS POWER SPECTRUM 
USING THE FLUX POWER SPECTRUM AND THE FLUX 
PDF 

The next step, after demonstrating that we can measure 
the flux pdf robustly, is to show that the flux pdf is useful 
in constraining the mass power spectrum, when employed 
in conjunction with the flux power spectrum. In particu- 
lar, we emphasize that the flux pdf is sensitive to A, the 
parameter related to the amplitude of the ionizing back- 
ground, and hence we can place constraints on the mass 
power spectrum, without making any assumptions about 
the mean flux in the Lya forest. 

In Figure 7 we show our pdf measurement at z — 2.72 

9 One might worry that this value of x 2 is actually too small. 
However, one should keep in mind that we are comparing two prob- 
ability distributions measured from the same data, yet with different 
data reduction procedures. 



as well as theoretical predictions for the pdf for the mod- 
els shown in Figure 1, each of which fits the observed flux 
power spectrum, in spite of having very different matter 
power spectra. While the flux pdfs of each model look 
fairly similar by eye, one can distinguish between them 
statistically. Of these models, the minimum \ 2 obtained 
from fitting to the PDF alone, occurs at (F/F c ) = 0.720, 
with x 2 = 38.88. The fit is reasonable given that we com- 
pare the theoretical flux pdf with the measured flux pdf for 
~ 34 degrees of freedom (36 data points - 1 normalization 
condition - 1 prior on To), and so the fit roughly corre- 
sponds to x 2 l v ~ 1-1; which should be exceeded randomly 
~ 26% of the time. The model with a lower mean flux of 
(F/F c ) = 0.680 is a significantly worse fit with x 2 = 43.26. 
Similarly models with significantly higher mean transmit- 
ted flux are also poor fits to the data. Specifically, the 
model with (F/F c ) = 0.750 has x 2 = 42.46, and the model 
with (F/F c ) = 0.770 has X 2 = 52.86. 

The key point is that it is easy to find two models that, 
while differing significantly in their photo-ionizing back- 
grounds, yield the same flux power spectrum (and hence 
have the same variance) - however these two models will 
generally differ in their higher moments. In particular, the 
model with (F/F c ) = 0.680 has a variance of (Sj) = 0.134 

and a skewness of S = {S s f ) / (Sj) 3 / 2 = -0.975. The model 
with (F/F c ) = 0.770 has a nearly identical variance, but 
its skewness is rather different, S = —1.24. To clarify the 
situation, let us consider a simplified argument for how the 
information in the skewness complements the information 
in the flux power spectrum. The pdf of Sf is mainly de- 
scribed by two parameters, its variance (S 2 ), and its skew- 
ness, S — (<5/}/(<5/) 3 / 2 - The underlying model, on the 
other hand, mainly depends on the variance of the density 
field, smoothed on some scale, (S 2 ), and the amplitude 
of the ionizing background, parameterized by A, or effec- 
tively (F/F c ). Now it is easy to imagine that one can 
adjust A to match the flux variance, (S 2 ) for density fields 
with very different variances, (5?). However, the skewness 
depends on the variance of the density field, (S 2 ), and on 
A in a different way than the flux variance (S 2 ), as one can 
see in Figures 10-12 of Gaztanaga & Croft (1999). Hence, 
the information in the skewness provides an effective way 
of breaking the degeneracy between the amplitude of the 
ionizing background and the amplitude/slope of the mat- 
ter power spectrum. 

Having demonstrated the sensitivity of the flux pdf to 
(F/F c ), we constrain (F/F c ) at z = 2.72 by marginal- 
izing over all of the other parameters. The results are 
shown in Figure 8, the reduced likelihood function for 
(F/F c ) using joint constraints from the flux power spec- 
trum and the flux pdf. The allowed 1-cr range is (F/F c ) = 
0.730 + 0.007 - 0.027, while the allowed 2-cr range is 
(F/F c ) = 0.730 + 0.014 - 0.038. The quoted central 
value, (F/F c ) = 0.730, corresponds to the mean trans- 
mitted flux that maximizes the likelihood function. The 
reduced likelihood function with the current data set is 
hence rather broad, only weakly constraining values of 
the mean transmitted flux less than our central value. 
However, we reiterate that our constraints on the mean 
transmitted flux do not rely on 'continuum fitting ' and are 
completely independent of the usual measurements. Next 
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Fig. 7. — Flux pdf of the models shown in Figure 1, illustrat- 
ing that the flux pdf is helpful in breaking the degeneracy between 
the mean transmission and the amplitude/slope of the mass power 
spectrum. One should keep in mind that the error bars in different 
8f bins are correlated. 



we compare our constraints with the priors adopted on 
the mean transmitted flux in the Lya forest adopted by 
other authors. The prior adopted by Croft et al. (2002) 
is (F/F c ) = 0.705 ± 0.012, the prior of Viel et al. (2004b) 
is (F/F c ) = 0.730 ± 0.011, and SMM03 consider two pri- 
ors on the mean transmitted flux: (F/F c ) = 0.742 ±0.012, 
and one with expanded error bars, (F/F c ) — 0.742 ±0.027. 
Our results are thus intermediate between the Croft et al. 
(2002) prior and the SMM03 prior, and do not strongly 
constrain either possibility. From Figure 8, one can see, 
however, that our constraint is narrower than the prior 
adopted by SMM03. 

These constraints are conservative in the sense that we 
assume very little about the shape and amplitude of the 
linear matter power spectrum. We can turn things around 
by requiring ACDM and examining how tight the con- 
straints on the mean transmitted flux become. Specifically, 
we adopt a 5% prior on the slope of the linear power spec- 
trum, and a 10% prior on the amplitude of the linear power 
spectrum, centered around a scale-invariant model with 
as{z = 0) = 0.9. The constraint on the mean transmission 
(2-<r) in the Lya forest becomes (F/F c ) = 0.724 ± 0.012. 
This can be compared to the (2-cr) constraint we ob- 
tain without any prior on the matter power spectrum, 
(F/F c ) = 0.730 + 0.014 - 0.038. The prior on the mat- 
ter power spectrum thus disfavors the very small values of 
the mean transmission that are allowed without the prior. 

How tight do the constraints on the slope and the ampli- 
tude of the mass power spectrum become, when assuming 
nothing about the mean transmission but using the flux 
pdf? In Figure 9 we show the constraint on the mass power 
spectrum amplitude and slope using the information from 
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Fig. 8. — Reduced likelihood function for the mean flux, (F/F c ), 
marginalized over all of the other parameters at z = 2.72. The solid 
red line is our constraint from combining the flux pdf with the flux 
power spectrum. The green dotted line is the 'expanded error' prior 
suggested by SMM03, based on local continuum- fitting estimates of 
the mean transmitted flux. The black short-dashed line is the prior 
adopted by Croft et al. (2002) based on the PRS measurements of 
the mean transmitted flux. The blue long-dashed line is the prior 
adopted by Viel et al. (2004) based on their own local continuum- 
fitting estimates of the mean transmitted flux. 

both the flux pdf and the flux auto spectrum. One can 
see that while the constraints on the mass power spec- 
trum amplitude and slope are still fairly weak, using the 
flux pdf as well as the auto spectrum significantly tightens 
the constraints. The flux pdf disfavors the high values of 
the mean transmitted flux that fit the flux power spectrum 
with high amplitude mass power spectrum normalizations. 
The information in the flux pdf thereby helps to break the 
degeneracy between the mean transmitted flux and the 
mass power spectrum normalization/slope by tightening 
constraints on the mean transmitted flux. In the Appendix 
we estimate the systematic error on our constraints result- 
ing from the limited resolution of our simulations. 

5. CONSTRAINTS ON THE SLOPE OF THE 
TEMPERATURE-DENSITY RELATION 

In this section we illustrate that our measurement of 
the flux pdf also yields a constraint on the slope of the 
temperature-density relation, a, at z = 2.72. The re- 
lation between flux and density depends on the slope of 
the temperature-density relation [Eqn. (1)]. We find that, 
when all of the other modeling parameters are fixed, the 
probability of a pixel having no Lya absorption becomes 
smaller as a increases (Gaztanaga & Croft 1999). This 
distinction is somewhat washed out by the smoothing we 
apply to the spectra. In spite of this we still find a con- 
straint on the slope of the temperature-density relation, 
a. In Figure 10, we show the reduced likelihood func- 
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Fig. 9. — The red contours show 1- and 2-<r constraints on the mass 
power spectrum using both the probability distribution function of 
6f and the auto spectrum, without assuming any prior on the mean 
transmitted flux in the Lya forest. The black contour shows the 
2-<r constraint on the matter power spectrum using only the auto 
spectrum, without enforcing any prior on (F/F c ). 



Fig. 10. — Reduced likelihood function for the slope of the tem- 
perature density relation, a. The red solid line shows the likelihood 
function for a from comparing theoretical models with the observed 
flux power spectrum and flux pdf at z = 2.72. The blue dotted 
line illustrates that our results only weaken slightly if we relax our 
adopted prior on the temperature at mean density. 



tion for the parameter a obtained by marginalizing over 
all of the other parameters. From the figure one can see 
that a ;> 0.4 is required by our measurements at the 2-er 
level. We checked that this constraint does not depend 
significantly on our assumed prior on the temperature at 
mean density, T = 300 ±50 (km/s) 2 . Relaxing this prior, 
the constraint only weakens to requiring a J> 0.32 at 2-a. 
This constraint is surprising if Hell is reionized at redshifts 
slightly larger than z ~ 2.72, in which case we expect the 
slope of the temperature-density relation to be close to 
isothermal (Hui & Gnedin 1997). We caution however, 
that we only considered one thermal history in calculat- 
ing the gas pressure smoothing in our HPM simulations. 
Desjacques & Nusser (2004) find that the effect of varying 
the amount of gas pressure smoothing on the statistics of 
the Lya forest is degenerate with the effect of varying the 
slope of the temperature-density relation. Our constraint 
on the temperature-density relation is likely to weaken if 
we properly marginalize over the amount of gas pressure 
smoothing. 

6. DISCUSSION 

In this paper we have demonstrated that the informa- 
tion in the flux pdf, when used in conjunction with the 
flux power spectrum, can tighten constraints on the mat- 
ter power spectrum. In particular, the flux pdf of 5f is 
robust to our treatment of the quasar continuum, yet sen- 
sitive to the amplitude of the photo-ionizing background, 
a sensitivity that allows us to break degeneracies between 
the matter power spectrum and the amplitude of the ion- 



izing background. Hence, we can constrain the matter 
power spectrum without relying on measurements of the 
mean transmitted flux which depend on estimating the 
normalization of the quasar continuum. 

It is appropriate to consider the prospects for using this 
methodology to obtain tighter constraints on cosmology 
and the physics of the IGM in the future, especially since 
our present constraints are still quite weak. Observation- 
ally, the pdf of Sf can be measured with very high sta- 
tistical precision using the SDSS (Burgess, Buries et al., 
2005, in prep.). Theoretically, two main improvements can 
be expected. First, we will soon be able to simulate the 
same cosmological volume at higher resolution. This will 
allow us to utilize the small-scale information obtained 
in the flux power spectrum and the flux pdf and elimi- 
nate systematic errors due to poor resolution. Second, our 
pseudo-hydrodynamic approach should be compared sys- 
tematically with full hydrodynamic simulations (Gnedin & 
Hui 1998, Meiksin & White 2001, McDonald 2004b, Viel 
et al. 2005b). Alternatively, it may be feasible to run a 
grid of fully hydrodynamic simulations in the near future. 

There are a few other issues that would be interesting 
to pursue. First, the methodology described here can be 
applied at a range of redshifts to constrain the evolution 
of the slope of the temperature density relation, the mean 
transmitted flux and the amplitude of the ionizing back- 
ground, as well as the slope and amplitude of the linear 
matter power spectrum. Our methodology may actually 
be most useful at high redshift, where we did not compare 
our measurements with theoretical models. At high red- 
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shift, continuum- fitting should be most problematic since 
there are then zero, or very few, unabsorbed regions of 
spectra with which to define the continuum. Applying our 
method over a range of redshifts would also provide con- 
straints on the evolution of the linear matter power spec- 
trum with rcdshift, offering a test of the extent to which 
the structure in the Lya forest is governed by gravitational 
instability (Croft et al. 2002). Second, the probability 
distribution of Sf might be useful in constraining primor- 
dial non-gaussianity (Gaztanaga & Croft 1999). Third, we 
can generalize our measurements to examine the pdf as a 
function of smoothing scale. Finally, the flux pdf might be 
useful in constraining the impact of non-gravitational pro- 
cesses on the Lya forest. Gravitational instability makes 
definite predictions for the relationship between the vari- 
ance of the density field and its higher order moments - 
to the extent that the Lya forest is dominated by gravita- 
tional instability, these scalings should be imprinted on the 
flux statistics in the Lya forest (Zaldarriaga et al. 2001b, 
Fang & White 2004). The good agreement of our mea- 
sured flux pdf and simulations should therefore imply a 
constraint on feedback processes. 
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APPENDIX 

The primary purpose of this section is to examine the 
sensitivity of our results to limitations in our simulation 
box size and resolution. The ACDM simulations in this 
paper use 512 3 baryons and 512 3 dark matter particles, 
with 512 3 mesh points, in a 40 Mpc/h box. The simu- 
lations were run with a fast, parallel N-body code, MC 2 
(Mesh-based Cosmology Code). MC 2 was recently ex- 
tensively tested against five other state-of-the-art codes 
(Heitmann et al. 2004). A detailed description of the 
code will be given in Habib et al. (2005) (in prepara- 
tion), including a description of our implementation of 
HPM, the pseudo-hydrodynamic method of Gnedin & Hui 
(1998). Our implementation differs from that of Gnedin 
& Hui (1998) in that we follow separately two particle 
species: baryonic particles that experience gas forces as 
well as gravity, and dark matter particles that interact 
only gravitationally. We adopt the input transfer func- 
tion of Hu and Sugiyama (1996) with cosmological param- 
eters of VL m = 0.3, flA = 0.7, Qbh 2 — 0.02. Simulations 
with differing spectral indices, n, are each normalized so 
as to have the same amplitude, A 2 (k p ,z), on the scale 
k p — 0.008 s/km at a redshift of z = 3. The output with 
n = 1.0 has a normalization given by as(z = 0) = 0.84. 
Finally, we need to specify the thermal history of the IGM 
in order to compute the gas pressure term in our HPM 
simulation. We adopt a similar thermal history to that 
adopted in McDonald & Miralda-Escude (2001). Specifi- 
cally, we assume that T is 25, 000 K and that the slope of 



the temperature-density relation is a = at a reionization 
redshift of z rcion = 10, while T = 18,000 K and a = 0.3 
at z = 4. We linearly interpolate between these values to 
specify the gas pressure at intermediate redshifts, and use 
the z = 4 values at lower redshifts. While the true ther- 
mal history of the IGM may be rather different than this 
simple model, we find that the flux power spectrum on 
the large scales considered in this paper, k <; 0.02 s/km, 
are relatively insensitive to the thermal history. More sys- 
tematic tests of the dependence of our results on thermal 
history are clearly warranted, however. 

In Figure 11, we show flux power spectrum measure- 
ments at z = 3 for simulations with box sizes of 20, 40 and 
80 Mpc/h. Each simulation has a resolution equivalent 
to that of a 2 x 512 3 particle, 80 Mpc/h simulation. For 
the purpose of testing the resolution and box size of our 
simulations, we generate flux fields at z — 3.0 for a model 
with (a,n,k f ,T ,a,(F/F c )) = (0.2480,1.0,300 (km/s) 2 , 
0.4,0.684). For this model, the box size convergence test 
shown in the figure indicates that any difference in the 
flux power spectrum between the 40 Mpc/h simulation 
and the 80 Mpc/h is at most comparable to the (1-cr) sta- 
tistical error on the Croft et al. (2002) measurement, and 
always less than the 2-tr statistical errors shown in the fig- 
ure. Furthermore, much of the difference between the 40 
Mpc/h simulation and the 80 Mpc/h simulation is likely 
random, as opposed to systematic, owing simply to differ- 
ences in the random initial conditions adopted in the two 
simulations. Therefore, a 40 Mpc/h simulation box is ad- 
equate to achieve convergence at the level of the Croft et 
al. (2002) error bars. 

In Figure 12, we show the equivalent convergence test 
for the flux pdf. In particular, it is important to confirm 
that our results converge with box size, in spite of the size 
we choose for our large-radius filter, R = 500 km/s, which 
is roughly l/8th the size of the 40 Mpc/h simulation box. 
The figure illustrates that any difference between the flux 
pdf in a 40 Mpc/h simulation box and that in a 80 Mpc/h 
simulation box is very small. 

Another important aspect of our modeling is the choice 
of the smoothing scale, r, at which we smooth the data 
and simulated spectra in order to measure the pdf of 5f. 
We aim to choose this scale to be sufficiently large that 
we can guarantee the convergence of the flux pdf, when 
the flux fields are smoothed on the scale r, with increas- 
ing resolution. To test the convergence of the flux pdf 
with resolution we have generated the flux pdf assuming a 
range of different small-scale smoothings, r, in simulations 
of varying resolution. (The varying resolution simulations 
are described in more detail below.) In Figure 13 we show 
the convergence of our simulation measurements with in- 
creasing resolution for small-scale smoothings of r = 5 
km/s and r = 30 km/s. As expected, the convergence is 
better in the case of the r = 30 km/s filter than in the case 
of the r — 5 km/s filter. In the case of the r = 30 km/s 
filter it is clear that the convergence of our results is ade- 
quate for our present data sample. The convergence also 
appears surprisingly good in the case of the r = 5 km/s 
filter: the difference between the 256 3 , 40 Mpc/h and the 
512 3 , 40 Mpc/h simulation is only slightly larger than the 
statistical error bars. However, in our present analysis we 
conservatively adopt the r = 30 km/s filter. 

Next, we examine the convergence of the flux power 
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Fig. 11. — Convergence test of the flux power spectrum with 
box size at z = 3.0. In the top panel, we show the flux auto 
spectrum in 20,40 and 80 Mpc/h simulation boxes. Each model 
has the same resolution, corresponding to 2 X 128 3 , 2 X 256 3 , 
and 2 X 512 3 particles, respectively. The model is described by 
(a,n,T ,a,{F/F c )) = (0.2480,1.00,300 (km/s) 2 , 0.4,0.684). The 
bottom panel shows the fractional difference between the flux power 
spectrum in each of the 20 and 40 Mpc/h simulation boxes with that 
in the 80 Mpc/h simulation box. The green points show the size of 
the (2-cr) fractional error bars from Croft et al. (2002) at z = 2.72. 

spectrum with resolution. To perform this test, we mea- 
sure the flux power spectrum for several simulations of 
the same box size, but differing particle number. Specif- 
ically, we measure the flux power spectrum in 40 Mpc/h 
simulations with each of 2 x 128 3 , 2 x 256 3 and 2 x 512 3 
particles. Each of the 40 Mpc/h simulations are generated 
with similar initial conditions: the initial conditions for, 
e.g., the 2 x 256 3 simulation are generated by averaging 
the initial conditions for the 2 x 512 3 particle simulation 
over every eight particles. We also generate a 2 x 512 3 
particle, 60 Mpc/h simulation to further illustrate the con- 
vergence of our results with varying mesh size. This sim- 
ulation is generated with different initial conditions than 
that of the 40 Mpc/h simulations, but Figure 11 demon- 
strates that the resulting random differences between the 
flux power spectra should be small for the /c-modes of in- 
terest, k ;> 0.01 s/km. Finally, we extrapolate our results 
to the case of infinite resolution using Richardson extrap- 
olation. We make this extrapolation in two ways. First we 
assume that the convergence of the flux power spectrum 
for a given k-mode is linear in the mesh size, and second 
we assume that the convergence is quadratic in the mesh 
size. The results of our resolution test are shown in Fig- 
ure 14. Interestingly, the convergence appears better in 
the flux pdf than in the auto spectrum, although this is 
partly due to the large error bars on the pdf measurement; 
our pdf estimate comes from a smaller data sample than 



Fig. 12. — Convergence test of the flux pdf with box size at 
2 = 3.0. The simulation flux fields have been smoothed with filters 
of R = 500 km/s and r = 30 km/s. The flux pdf is shown in 
simulation boxes of size 20,40 and 80 Mpc/h, each with the same 
resolution. 

the sample Croft et al. (2002) use to estimate the flux 
power spectrum. At any rate, the test shows rather large 
systematic differences between the lowest resolution simu- 
lation, with 2 x 128 3 particle and the 2 x 512 3 particle, 40 
Mpc/h simulation. These differences are expected since 
the lowest resolution simulations do not resolve the gas 
pressure smoothing scale. The measurements made using 
the 2 x 256 3 , 40 Mpc/h simulation and the 2 x 512 3 , 60 
Mpc/h simulation are substantially closer to those from 
the 2 x 512 3 , 40 Mpc/h simulation. 

How close is the flux power spectrum in our 512 3 , 40 
Mpc/h simulation to what we would measure from an in- 
finite resolution simulation? This depends, of course, on 
how we extrapolate our measurements to infinite resolu- 
tion. If the convergence is as good as quadratic in the mesh 
size, then our present simulations are adequate: any sys- 
tematic difference between the 2 x 512 3 , 40 Mpc/h simula- 
tion and the quadratic extrapolation to perfect resolution 
is small compared to the statistical error bars on the Croft 
et al. (2002) measurement. However, if the convergence is 
only linear in the mesh size, then the figure illustrates that, 
even limiting our analysis to scales of k <J 0.02 s/km, we 
are making some non-negligible systematic error. Specifi- 
cally, assuming linear convergence, our 2 x 512 3 , 40 Mpc/h 
flux power spectrum is good to ~ 10%. While this conver- 
gence appears to be quite acceptable, it is in fact compa- 
rable to the 2-cr statistical error on the Croft et al. (2002) 
measurement at z — 2.72. It is important to decide which 
is a more accurate description of the convergence of our re- 
sults with mesh size: linear or quadratic convergence? At 
present, the best we can do to answer this question is to 
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Fig. 13. — Convergence test of the flux pdf with resolution at 
2 = 3.0. We show the flux pdf in a 40 Mpc//i simulation box for 
simulations with 2 X 128 3 , 2 X 256 3 , and 2 X 512 3 particles, as well as 
the flux pdf in a 60 Mpc/h simulation with 2 X 512 3 particles. The 
panels on the left side show the flux pdf for a small-scale smoothing 
of r = 5 km/s and the panels on the right hand side show the flux 
pdf for a small-scale smoothing of r = 30 km/s. In each case the flux 
fields were smoothed with the same large radius filter of R = 500 
km/s. The top panels show the measurements themselves, while 
the bottom panels show the fractional difference between the lower 
resolution measurements and the high resolution measurement. The 
green points represent the approximate (1-u) fractional error from 
the present data sample. 

ask how well we can predict the results of the 2 x 512 3 , 40 
Mpc//i simulation by extrapolating from the lower resolu- 
tion simulations. On this basis, we find that neither linear 
nor quadratic convergence is a good description over the 
full range of fc-modes we consider. At the fc-modes that 
our most important for our convergence study, k ~ 0.02 
s/km, by making the (worse-case) assumption that linear 
convergence is an accurate description, we can check how 
large a systematic error we are possibly making due to 
imperfect simulation resolution. 

In order to gauge the importance of any systematic error 
from the limited resolution of our simulations we do the 
following check. We assume that the convergence of our 
flux power spectrum measurement is linear in the mesh 
size, and derive a correction factor at each fc-mode by di- 
viding the linearly-extrapolated flux power by that in our 
2 x 512 3 particle, 40 Mpc/h simulation box (see also Mc- 
Donald ct al. 2004b) . Assuming that this correction factor 
is model-independent, we then multiply each model in our 
grid by this correction and examine how much our con- 
straints on the slope and amplitude of the matter power 
spectrum change. This is clearly an imperfect procedure: 
the relevant correction factor is likely model-dependent 
and our assumption of linear convergence is not strongly 



Fig. 14. — Convergence test of the flux auto spectrum with reso- 
lution at z = 3.0. We show the flux auto spectrum in a 40 Mpc//i 
simulation box for each of 2 X 128 3 , 2 X 256 3 , and 2 X 512 3 parti- 
cles. We also show the flux auto spectrum in a 2 X 512 3 particle, 
60 Mpc/ft simulation box, and two extrapolations of the flux auto 
spectrum to infinite resolution. In one extrapolation we assume that 
the flux power in each fc-mode converges linearly with the mesh size, 
and in the other extrapolation we assume that the flux power con- 
verges quadratically with the mesh size. The top panel shows the 
flux power, while the bottom panel shows the fractional difference 
with the 512 3 , 40 Mpc/h simulation. The green points show the 
size of the (2-ct) fractional error bars from Croft et al. (2002) at 
z = 2.72. In our present analysis, we only include measurements on 
scales larger than that indicated by the vertical dashed line. 

established. In spite of these uncertainties, this test gives 
some indication of how much our results might change with 
improved resolution. This is shown in Figure 15, where 
the black contours show our current constraints, and the 
red contours indicate our estimate of the same constraints 
given simulations of infinite resolution. From this plot, 
the main effect of our limited resolution appears to be to 
a systematic shift in the inferred slope of the matter power 
spectrum towards steeper slopes by ~ 0.1. The 'resolution 
corrected' contours are also less elongated towards large 
matter power spectrum amplitudes. From this test, the 
effect of limited resolution appears to be most degener- 
ate with the slope of the matter power spectrum (see also 
SMM03). While our present constraints are weak, it is 
clear that higher resolution simulations will be necessary 
to obtain more precise constraints in the future. 
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Fig. 15. — An estimate of the systematic error in our constraints 
on the linear matter power spectrum owing to the imperfect reso- 
lution of our numerical simulations. The black contours indicate 1- 
and 2-(j constraints on the slope and amplitude of the linear mat- 
ter power spectrum. The red contours are estimates of the same 
constraints we expect given infinite resolution simulations. 
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Table 2 

Quasar Spectra and Wavelength Range used in 
the PDF Measurement 







-•Miighl/ 1 ,) 


Q2343+123 2.52 


3922.0 


3926.6 




3929.3 


3966.5 




3970.5 


3996.3 




4000.5 


4053.2 




4060.6 


4077. 




4097. 


4108.8 


Q1442+293 2.67 


3764.4 


4089.5 




4095.1 


4100.2 




4105.5 


4156.2 




4204.5 


4250.0 




4336.8 


4375.5 




4379.6 


4409.28 


Q1107+485 3.00 


4233.0 


4261.3 




4262.7 


4306.1 




4308.8 


4364.7 




4365.4 


4408.5 




4409.5 


4434.5 




4437.5 


4444.8 




4446.5 


4485.7 




4491.9 


4534.8 




4541.9 


4549.4 




4550.9 


4656.3 




4660.3 


4670.1 




4678.9 


4698.2 




4701.3 


4741.3 




4744.9 


4797.9 


Q1425+604 3.20 


4314.0 


4320.7 




4321.9 


4373.0 




4375.5 


4462.5 




4467.5 


4486.0 




4490.6 


4492.3 




4502.7 


4546.0 




4571.9 


4591.3 




4593.8 


4599.0 




4603.5 


4606.6 




4753.9 


4821.1 




4829.0 


4858.1 




4862.3 


4905.4 




4910.0 


4916.3 




4918.0 


4980.5 




4996.9 


5032.0 


Q1422+230 3.62 


4738.1 


4786.4 




4818.0 


4884.5 




4885.7 


4890.6 




4892.2 


4980.7 




4982.2 


5002.3 




5003.7 


5224.3 




5225.8 


5287.3 




5288.5 


5364.6 




5369.0 


5472.4 




5477.0 


5523.0 



Note. — Tabic continued on the next page. The quasar spectra 
used in the pdf measurement. The wavelength range specified indi- 
cates the pixels that are used in the analysis, other data are excluded 
due to the expected existence of metal lines, damped lyman-alpha 
systems, or spurious pixels. See also Rauch et al. (1997) and Mc- 
Donald et al. (2000). 
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Table 3 

Quasar Spectra and Wavelength Range used in 
the PDF Measurement (continued) 



Quasar Spectrum 


2cm 


Alow (A) 


Ahigh(A) 


Q000-262 


4.11 


5600.0 


5637.0 






5639.8 


5704.4 






5706.4 


5715.2 






5718.2 


5724.9 






5727.2 


5813.5 






5815.3 


5859.7 






5860.8 


5953.5 






5956.4 


6092.0 


Q2237-061 


4.55 


5692.0 


5703.8 






5705.0 


5747.4 






5749.7 


5774.4 






5774.8 


5803.3 






5805.7 


5815.3 






5817.6 


5830.4 






5831.5 


5876.9 






5878.1 


5891.1 






5892.0 


5897.0 






5898.0 


5967.0 






5967.2 


5974.8 






5975.6 


6020.1 






6020.8 


6034.6 






6036.5 


6042.7 






6055.5 


6063.4 






6068.9 


6100.0 






6250.0 


6251.6 






6255.4 


6282.3 






6290.3 


6314.5 






6315.5 


6330.6 






6332.3 


6402.3 






6405.8 


6482.3 






6485.8 


6502.0 






6503.5 


6549.1 






6550.0 


6582.1 






6582.2 


6600.0 



Note. — Table continued from the previous page. The quasar spec- 
tra used in the pdf measurement. The wavelength range specified 
indicates the pixels that are used in the analysis, other data are ex- 
cluded due to the expected existence of metal lines, damped lyman- 
alpha systems or spurious pixels. See also Rauch et al. (1997) and 
McDonald et al. (2000). 
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Table 4 

The probability distribution of Sf at (z) — 2.72. 



Bin No. Sf P(S f ) <J 2 P (6 f ) Bin No. 6 f P(S f ) *%(6 f ) 



1 


-0.9983E+00 





.3483E+00 


0. 


1406E- 


■01 


19 


0.9415E-01 


0.1964E+01 


0. 


.5558E- 


■01 


2 


-0.9596E+00 


0. 


.3390E+00 


0. 


4917E- 


■02 


20 


0. 


1526E+00 


0.1872E+01 


0. 


.3540E- 


■01 


3 


-0.8913E+00 


0. 


1718E+00 


0. 


1371E- 


■02 


21 


0, 


2142E+00 


0.1714E+01 


0. 


5379E- 


■01 


4 


-0.8317E+00 





.2090E+00 


0. 


1600E- 


■02 


22 


0. 


2739E+00 


0.1235E+01 


0. 


.4550E- 


■01 


5 


-0.7706E+00 





.1858E+00 


0. 


1213E- 


■02 


23 


0. 


3366E+00 


0.6548E+00 


0. 


1588E- 


■01 


6 


-0.7013E+00 


0. 


1486E+00 


0, 


.1105E- 


■02 


24 


0. 


.3985E+00 


0.4273E+00 


0. 


,8478E- 


■02 


7 


-0.6450E+00 


0. 


1765E+00 


0. 


.9211E- 


■03 


25 


0. 


.4584E+00 


0.3530E+00 


0. 


.7458E- 


■02 


8 


-0.5853E+00 


0. 


.2276E+00 


0. 


1547E- 


■02 


26 





5206E+00 


0.1533E+00 


0. 


1817E- 


■02 


9 


-0.5259E+00 


0. 


.2322E+00 


0. 


1491E- 


■02 


27 


0. 


.5837E+00 


0.1486E+00 


0. 


1890E- 


■02 


10 


-0.4593E+00 


0. 


.2647E+00 


0, 


1542E- 


■02 


28 


0. 


.6443E+00 


0.1207E+00 


0. 


1695E- 


■02 


11 


-0.4016E+00 


0. 


.3669E+00 


0. 


2094E- 


■02 


29 


0. 


.7081E+00 


0.5109E-01 


0. 


5303E- 


■03 


12 


-0.3379E+00 





.3715E+00 


0. 


.2788E- 


■02 


30 


0. 


7693E+00 


0.6966E-01 


0. 


7736E- 


■03 


13 


-0.2766E+00 


0. 


.3437E+00 


0. 


.1596E- 


■02 


31 


0, 


.8312E+00 


0.5109E-01 


0. 


.7147E- 


■03 


14 


-0.2161E+00 





.3622E+00 


0. 


2797E- 


■02 


32 


0. 


8806E+00 


0.1393E-01 


0. 


6471E- 


■04 


15 


-0.1519E+00 


0. 


.5387E+00 


0. 


.3878E- 


■02 


33 


0. 


9382E+00 


0.2322E-01 


0. 


.2808E- 


■03 


16 


-0.9288E-01 


0. 


6966E+00 


0. 


7684E- 


■02 


34 


0. 


1014E+01 


0.4644E-02 


0. 


.3863E- 


-04 


17 


-0.2913E-01 





.8545E+00 


0. 


.1101E- 


■01 


35 


0. 


1091E+01 


0.4644E-02 


0. 


.2454E- 


-04 


18 


0.3326E-01 


0. 


1547E+01 


0. 


.7748E- 


■01 


36 


0. 


1130E+01 


0.4644E-02 


0. 


9799E- 


■04 



Note. — The one point probability distribution of flux as a function 
of Sf. The first four columns are respectively the bin number, the 
average Sj in the bin, the average pdf in the bin, and a jackknifc 
estimate of the error in the pdf. The fifth through eighth columns 
are the same as the first four columns for higher flux bins. 



Table 5 

The probability distribution of Sf at {2) = 2.26. 



•in No. 


S f 

J 




P(Sf) 




a%(Sf) 


Bin No. 


S f 

J 


P(Sf) 


a%(5f) 


1 


-0.1015E+01 


0. 


1382E+00 


0. 


.5478E-02 


19 


0.8905E-01 


0.2683E+01 


0.1241E+00 


2 


-0.9534E+00 


0. 


1219E+00 


0. 


.2655E-02 


20 


0.1557E+00 


0.1894E+01 


0.5844E-01 


3 


-0.8912E+00 


0. 


2032E+00 





.8366E-02 


21 


0.2122E+00 


0.1528E+01 


0.8463E-01 


4 


-0.8349E+00 


0. 


1138E+00 


0, 


1236E-02 


22 


0.2716E+00 


0.8292E+00 


0.3612E-01 


5 


-0.7647E+00 


0. 


1382E+00 


0. 


1511E-02 


23 


0.3373E+00 


0.2764E+00 


0.9017E-02 


6 


-0.7096E+00 


0. 


1382E+00 


0. 


1374E-02 


24 


0.3928E+00 


0.1707E+00 


0.8643E-02 


7 


-0.6471E+00 


0. 


1301E+00 


0. 


.1234E-02 


25 


0.4600E+00 


0.9755E-01 


0.3620E-02 


8 


-0.5816E+00 


0. 


1301E+00 


0. 


1092E-02 


26 


0.5145E+00 


0.1219E+00 


0.8451E-02 


9 


-0.5251E+00 


0. 


1301E+00 


0. 


1057E-02 


27 


0.5767E+00 


0.9755E-01 


0.4756E-02 


10 


-0.4579E+00 


0. 


1707E+00 





1856E-02 


28 


0.6210E+00 


0.8129E-02 


0.6836E-04 


11 


-0.4060E+00 


0. 


2114E+00 


0. 


.1800E-02 


29 


0.0000E+00 


0.0000E+00 


0.0000E+00 


12 


-0.3429E+00 


0. 


1707E+00 


0. 


1859E-02 


30 


0.0000E+00 


0.0000E+00 


0.0000E+00 


13 


-0.2701E+00 


0. 


2520E+00 





.3739E-02 


31 


0.0000E+00 


0.0000E+00 


0.0000E+00 


14 


-0.2121E+00 


0. 


3821E+00 


0. 


.5326E-02 


32 


0.0000E+00 


0.0000E+00 


0.0000E+00 


15 


-0.1502E+00 


0. 


4796E+00 


0. 


.4609E-02 


33 


0.0000E+00 


0.0000E+00 


0.0000E+00 


16 


-0.9019E-01 


0. 


7560E+00 


0. 


2314E-01 


34 


0.0000E+00 


0.0000E+00 


0.0000E+00 


17 


-0.2765E-01 


0. 


1366E+01 


0. 


.5216E-01 


35 


0.0000E+00 


0.0000E+00 


0.0000E+00 


18 


0.3238E-01 


0. 


3512E+01 


0.3496E+00 


36 


0.0000E+00 


0.0000E+00 


0.0000E+00 



Note. — The same as Table (4), except for the rcdshift bin with 
(z) = 2.26 (see Table 1). 



Table 6 

The probability distribution of Sf at {2) = 3.28. 



a No. 


Sf 




P(Sf) 






Bin No. 




Sf 


P(Sf) 




^p(Sf) 


1 


-0.9947E+00 


0. 


.2902E+00 


0, 


1450E- 


■01 


19 


0.9358E-01 


0.1049E+01 


0. 


2613E- 


•01 


2 


-0.9543E+00 


0. 


4551E+00 


0. 


.8468E- 


■02 


20 


0. 


1538E+00 


0.1194E+01 


0. 


2766E- 


■01 


3 


-0.8906E+00 





.4353E+00 


0. 


2913E- 


■02 


21 


0. 


2154E+00 


0.1372E+01 


0. 


5276E- 


■01 


4 


-0.8287E+00 


0. 


.3232E+00 


0. 


.5159E- 


•02 


22 


0. 


.2761E+00 


0.1200E+01 


0. 


1899E- 


■01 


5 


-0.7692E+00 





.3034E+00 


0. 


.2115E- 


■02 


23 


0. 


3388E+00 


0.9695E+00 


0. 


1769E- 


•01 


6 


-0.7084E+00 


0. 


.3100E+00 


0. 


.2483E- 


•02 


24 


0. 


.3984E+00 


0.7584E+00 


0. 


.1251E- 


■01 


7 


-0.6432E+00 


0. 


.3166E+00 


0. 


.3222E- 


•02 


25 


0. 


.4634E+00 


0.6529E+00 


0. 


1901E- 


■01 


8 


-0.5831E+00 


0. 


.2704E+00 


0. 


1906E- 


•02 


26 


0. 


.5220E+00 


0.4221E+00 


0. 


4633E- 


■02 


9 


-0.5225E+00 





.2770E+00 


0. 


2094E- 


-02 


27 


0. 


5860E+00 


0.2770E+00 


0. 


.4519E- 


■02 


10 


-0.4599E+00 


0. 


.3363E+00 


0. 


3365E- 


■02 


28 


0. 


.6418E+00 


0.3297E+00 


0. 


6822E- 


■02 


11 


-0.4017E+00 


0. 


.3891E+00 


0. 


2566E- 


■02 


29 


0. 


7016E+00 


0.1451E+00 


0. 


.3897E- 


■02 


12 


-0.3419E+00 





.4023E+00 


0. 


2653E- 


02 


30 


0. 


7641E+00 


0.7914E-01 


0. 


.1172E- 


-02 


13 


-0.2799E+00 





.4814E+00 


0. 


4446E- 


■02 


31 


0. 


.8174E+00 


0.1319E-01 


0. 


.8699E- 


■04 


14 


-0.2151E+00 





.4155E+00 


0. 


.4118E- 


■02 


32 


0. 


9020E+00 


0.3957E-01 


0. 


5024E- 


■03 


15 


-0.1488E+00 


0. 


.4814E+00 


0. 


.3175E- 


■02 


33 


0. 


9599E+00 


0.6595E-01 


0. 


1799E- 


■02 


16 


-0.9270E-01 


0. 


.6265E+00 


0. 


.4565E- 


•02 


34 


0. 


.1018E+01 


0.1978E-01 


0. 


3634E- 


■03 


17 


-0.2955E-01 


0. 


6925E+00 


0. 


.6691E- 


•02 


35 


0. 


.1087E+01 


0.3957E-01 


0. 


.2377E- 


■03 


18 


0.3172E-01 


0. 


.7848E+00 


0. 


1571E- 


01 


36 


0. 


1140E+01 


0.3297E-01 


0. 


1672E- 


•02 



Note. — The same as Table (4), except for the redshift bin with 
(2) = 3.28 (see Table 1). 



Table 7 

The probability distribution of Sf at {2) = 3.99. 



a No. 


Sf 




P{S f ) 




°%{Sf) 


Bin No. 




Sf 


P(Sf) 






1 


-0.1016E+01 


0. 


.7323E+00 


0, 


4294E- 


•01 


19 


0.9680E-01 


0.5300E+00 


0. 


6503E- 


■02 


2 


-0.9549E+00 





.7253E+00 


0. 


1136E- 


■01 


20 


0. 


1511E+00 


0.5789E+00 


0. 


5608E- 


•02 


3 


-0.8961E+00 


0. 


4394E+00 


0. 


3914E- 


•02 


21 


0, 


.2174E+00 


0.5789E+00 


0. 


.7551E- 


•02 


4 


-0.8317E+00 


0. 


.4882E+00 


0. 


.5043E- 


•02 


22 


0. 


.2751E+00 


0.5440E+00 


0. 


7219E- 


■02 


5 


-0.7688E+00 


0. 


4673E+00 


0. 


3830E- 


•02 


23 


0, 


3359E+00 


0.5161E+00 


0. 


.3712E- 


■02 


6 


-0.7093E+00 


0. 


.4673E+00 


0. 


.4368E- 


•02 


24 


0. 


.3970E+00 


0.5370E+00 


0. 


3745E- 


■02 


7 


-0.6479E+00 


0. 


.4254E+00 


0. 


.3701E- 


■02 


25 


0. 


.4651E+00 


0.4882E+00 


0. 


.3472E- 


■02 


8 


-0.5825E+00 


0. 


3906E+00 


0. 


.4653E- 


•02 


26 


0. 


.5239E+00 


0.4533E+00 


0. 


9987E- 


•02 


9 


-0.5201E+00 





.3906E+00 


0. 


4975E- 


■02 


27 


0. 


.5851E+00 


0.3696E+00 


0. 


.2578E- 


■02 


10 


-0.4614E+00 





.3627E+00 


0. 


3310E- 


•02 


28 


0. 


6461E+00 


0.5021E+00 


0. 


,8142E- 


•02 


11 


-0.3998E+00 


0. 


.4603E+00 


0. 


3210E- 


■02 


29 


0. 


7059E+00 


0.4254E+00 


0. 


2967E- 


-02 


12 


-0.3389E+00 


0. 


.4324E+00 


0, 


.5804E- 


■02 


30 


0. 


7680E+00 


0.3975E+00 


0. 


.7950E- 


•02 


13 


-0.2751E+00 





.3766E+00 


0. 


2644E- 


■02 


31 


0. 


8286E+00 


0.3208E+00 


0. 


.4973E- 


•02 


14 


-0.2171E+00 


0. 


.4603E+00 


0. 


5976E- 


02 


32 


0. 


.8864E+00 


0.2511E+00 


0. 


.2959E- 


■02 


15 


-0.1496E+00 


0. 


.4742E+00 


0. 


.3926E- 


•02 


33 


0. 


.9561E+00 


0.1883E+00 


0. 


.2852E- 


•02 


16 


-0.9083E-01 


0. 


.4394E+00 


0. 


.4400E- 


•02 


34 


0. 


1019E+01 


0.8369E-01 


0. 


1274E- 


■02 


17 


-0.2869E-01 


0, 


.4533E+00 


0. 


.5302E- 


•02 


35 


0. 


1067E+01 


0.1186E+00 


0. 


1113E- 


■02 


18 


0.3169E-01 


0. 


.5440E+00 


0. 


.6711E- 


■02 


36 


0. 


1463E+01 


0.8369E+00 


0. 


1582E- 


■01 



Note. — The same as Table (4), except for the rcdshift bin with 
(2) = 3.99 (see Table 1). 



